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<210> 1 

<211> 3252 

<212> DNA 

<213> Homo sapiens 



<400> 1 

gcgggagtgc tgcctgccca cggcacccag cacggcatcc ggctgcccct gcgcagcggc 6 0 
ct ggggggcg cccccctggg gctgcggctg ccccgggaga ccgacgaaga gcccgaggag 120 
cccggccgga ggggcagctt tgtggagatg gtggacaacc tgaggggcaa gtcggggcag 180 
ggctactacg tggagatgac cgtgggcagc cccccgcaga cgctcaacat cctggtggat 240 
acaggcagca gtaactttgc agtgggtgct gccccccacc ccttcctgca tcgctactac 300 
cagaggcagc tgtccagcac ataccgggac ctccggaagg gtgtgtatgt gccctacacc 36 0 
cagggcaagt gggaagggga gctgggcacc gacctggtaa gcatccccca tggccccaac 42 0 
gtcactgtgc gtgccaacat tgctgccatc actgaatcag acaagttctt catcaacggc 480 
tccaactggg aaggcatcct ggggctggcc tatgctgaga ttgccaggcc tgacgactcc 540 
ctggagcctt tctttgactc tctggtaaag cagacccacg ttcccaacct cttctccctg 600 



1 



cagctttgtg gtgctggctt ccccctcaac 
agcatgatca ttggaggtat cgaccactcg 
atccggcggg agtggtatta tgaggtgatc 
ctgaaaatgg actgcaagga gtacaactat 
aaccttcgtt tgcccaagaa agtgtttgaa 
tccacggaga agttccctga tggtttctgg 
ggcaccaccc cttggaacat tttcccagtc 
aaccagtcct tccgcatcac catccttccg 
gccacgtccc aagacgactg ttacaagttt 
atgggagctg ttatcatgga gggcttctac 
ggctttgctg tcagcgcttg ccatgtgcac 
ccttttgtca ccttggacat ggaagactgt 
accctcatga ccatagccta tgtcatggct 
tgcctcatgg tgtgtcagtg gcgctgcctc 
gctgatgaca tctccctgct gaagtgagga 
ggaccacacc tccgtggttc actttggtca 
cagagcacct caggaccctc cccacccacc 
ggctggcaag gtgggttcca gggactgtac 
cactctgctg gcgggaatac tcttggtcac 
tgaaacttca gccctgaacc tttgtccacc 
cttcttttct tagtttcaga agtactggca 
tgtggtaccc tggcagagaa gagaccaagc 
gaggatgcac agtttgctat ttgctttaga 
ggtgcaaaga ttgcctcttg aattaaaaaa 
gcggctggaa agaggagaag gagagggagt 
aggaaaggca gaaacacaac cactcaccag 
catcccatct cagaagatgg gtgttgtttt 
accaaaagtg agatgggaag ggcttatcta 
tgaagtgccc actaagaagt tccacttaac 
tctctatctg aaccaccctt tattctacat 
cctaagctcc aggtgccctg tgggagagca 
tcctggtcat aggctcactc tttcccccaa 
ctaaaaggaa taggtaggag acctcttcta 
tcattcaaca gctgatgccc tataacccct 
agtagcaaga tctttacata attcagagtg 
cccctccatt tatttgacta aagcatcrca 
agaaatacag tgctttatgg ctctaacatt 
aaggatggca gcctcagggc ttccttatgt 
catctttttc ccctatcctg ttcttcccct 
gctggttctt gggctaggta gtggggacca 
agtaaactac ggtaccagtg ttagtgggaa 
tcctactcct acctggtcaa cccgctgctt 
acctgataag ggagagggaa atacaaggag 
gcccmcaagc cataaaccaa taaamcaaga 
aaaaaaaaaa aa 



cagtctgaag tgctggcctc tgtcggaggg 660 
ctgtacacag gcagtctctg gtatacaccc 720 
attgtgcggg tggagatcaa tggacaggat 780 
gacaagagca ttgtggacag tggcaccacc 840 
gctgcagtca aatccatcaa ggcagcctcc 900 
ctaggagagc agctggtgtg ctggcaagca 960 
atctcactct acctaatggg tgaggttacc 1020 
cagcaatacc tgcggccagt ggaagatgtg 10 80 
gccatctcac agtcatccac gggcactgtt 1140 
gttgtctttg atcgggcccg aaaacgaatt 12 00 
gatgagttca ggacggcagc ggtggaaggc 12 60 
ggctacaaca ttccacagac agatgagtca 1320 
gccatctgcg ccctcttcat gctgccactc 13 80 
cgctgcctgc gccagcagca tgatgacttt 1440 
ggcccatggg cagaagatag agattcccct 1500 
caagtaggag acacagatgg cacctgtggc 1560 
aaatgcctct gccttgatgg agaaggaaaa 1620 
ctgtaggaaa cagaaaagag aagaaagaag 1680 
ctcaaattta agtcgggaaa ttctgctgct 1740 
attcctttaa attctccaac ccaaagtatt 1800 
tcacacgcag gttaccttgg cgtgtgtccc 1860 
ttgtttccct gctggccaaa gtcagtagga 1920 
gacagggact gtataaacaa gcctaacatt 1980 
aaactagatt gactatttat acaaatgggg 2 040 
acaaagacag ggaatagtgg gatcaaagct 2100 
tcctagtttt agacctcatc tccaagatag 2160 
caatgttttc ttttctgtgg ttgcagcctg 2220 
gccaaagagc tcttttttag ctctcttaaa 2280 
acatgaattt ctgccatatt aatttcattg 2340 
atgataggca gcactgaaat atcctaaccc 2400 
actggactat agcagggctg ggctctgtct 2460 
atcttcctct ggagctttgc agccaaggtg 2 520 
tctaatcctt aaaagcataa tgttgaacat 2580 
gcctggattt cttcctatta ggctataaga 2640 
gtttcattgc cttcctaccc tctctaatgg 2700 
cagtggcact agcattatac caagagtatg 2760 
actgccttca gtatcaaggc tgcctggaga 2820 
cctccaccac aagagctcct tgatgaaggt 2 880 
ccccgctcct aatggtacgt gggtacccag 2 940 
agttcattac ctccctatca gttctagcat 3 000 
gagctgggtt ttcctagtat acccactgca 3060 
ccaggtatgg gacctgctaa gtgtggaatt 3120 
ggcctctggt gttcctggcc tcagccagct 3180 
atactgagtc taaaaaaaaa aaaaaaaaaa 3240 

3252 



<210> 2 



2 



<211> 488 
<212> PRT 
<213> Homo sapiens 



<220> 

<223> Purified Memapsin 2 



<220> 

<223> Amino Acids 28-48 are remnant putative propeptide 
residues 



<220> 

<223> Amino Acids 58-61/ 78, 80, 82-83, 116, 118-121, 
156, 166, 174, 246, 274, 276, 278-281, 283, and 
376-377 are residues in contact with the OM99-2 
inhibitor 



<220> 

<223> Amino acids 54-57, 61-68, 73-80, 86-89, 109-111, 
113-118, 123-134, 143-154, 165-168, 198-202, and 
220-224 are N-lobe Beta Strands 

<220> 

<223> Amino Acids 184-191 and 210-217 are N-lobe Helices 
<220> 

<223> Amino acids 237-240, 247-249, 251-256, 259-260, 
273-275, 282-285, 316-318, 331-336, 342-348, 
354-357, 366-370, 372-375, 380-383, 390-395, 
400-405, and 418-420 are C-lobe Beta Strands 

<220> 

<223> Amino Acids 286-299, 307-310, 350-353, 384-387, 
and 427-431 are C-lobe Helices 



<400> 2 

Ala Gly Val Leu 
1 

Leu Arg Ser Gly 
20 

Glu Thr Asp Glu 
35 

Glu Met Val Asp 
50 



Pro Ala His Gly 
5 

Leu Gly Gly Ala 



Glu Pro Glu Glu 
40 

Asn Leu Arg Gly 
55 



Thr Gin His Gly 
10 

Pro Leu Gly Leu 
25 

Pro Gly Arg Arg 



Lys Ser Gly Gin 
60 



lie Arg Leu Pro 
15 

Arg Leu Pro Arg 
30 

Gly Ser Phe Val 
45 

Gly Tyr Tyr Val 



3 



Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp 
65 70 75 80 

Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu 
85 90 95 

His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg 
100 105 no 

Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu 
115 120 125 

Gly Thr Asp Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg 
130 135 140 

Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly 
145 150 155 160 

Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg 
165 170 175 

Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr 
180 185 190 

His Val Pro Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro 
195 200 205 

Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He 
210 215 220 

Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro 
225 230 235 240 

He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He 
245 250 255 

Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys 
260 265 270 

Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val 
275 280 285 

Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys 
290 295 300 

Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala 
305 310 315 320 
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Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met 
325 330 335 



Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin 
340 345 350 

Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr 
355 360 365 

Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val 
370 375 380 

He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He 
385 390 395 400 

Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala 
405 410 415 

Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr 
420 425 430 

Asn He Pro Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val 
435 440 445 

Met Ala Ala He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val 
450 455 460 

Cys Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe 
465 470 475 480 

Ala Asp Asp He Ser Leu Leu Lys 
485 



<210> 3 
<211> 503 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Pro -memaps in 2 

<220> 

<223> Amino Acids 1-15 are vector-derived residues 
<220> 

<223> Amino Acids 16-64 are a putative pro peptide 
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<220> 

<223> Amino Acids 1-13 are the T7 promoter 



<220> 

<223> Amino Acids 16-456 are Pro-memapsin 2-T1 
<220> 

<223> Amino Acids 16-421 are Promemapsin 2-T2 
<400> 3 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Ala 
1 5 10 15 

Gly Val Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu 
20 25 30 

Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu 
35 40 45 

Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu 
50 55 60 

Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu 
65 70 75 80 

Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Thr 
85 90 95 

Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His 
100 105 no 

Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys 
115 120 125 

Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly 
130 135 140 

Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala 
145 150 155 160 

Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser 
165 170 175 

Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro 
180 185 190 

Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His 
195 200 205 
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Val Pro Asn Leu Phe 
210 

Asn Gin Ser Glu Val 
225 

Gly lie Asp His Ser 
245 

Arg Arg Glu Trp Tyr 
260 

Gly Gin Asp Leu Lys 
275 

lie Val Asp Ser Gly 
290 

Glu Ala Ala Val Lys 
305 

Pro Asp Gly Phe Trp 
325 

Thr Thr Pro Trp Asn 
340 

Glu Val Thr Asn Gin 
355 

Leu Arg Pro Val Glu 
370 

Phe Ala lie Ser Gin 
385 

Met Glu Gly Phe Tyr 
405 

Phe Ala Val Ser Ala 
420 

Val Glu Gly Pro Phe 
435 

He Pro Gin Thr Asp 
450 



Ser Leu Gin Leu Cys Gly 
215 

Leu Ala Ser Val Gly Gly 
230 235 

Leu Tyr Thr Gly Ser Leu 
250 

Tyr Glu Val He He Val 
265 

Met Asp Cys Lys Glu Tyr 
280 

Thr Thr Asn Leu Arg Leu 
295 

Ser He Lys Ala Ala Ser 
310 315 

Leu Gly Glu Gin Leu Val 
330 

He Phe Pro Val He Ser 
345 

Ser Phe Arg He Thr He 
360 

Asp Val Ala Thr Ser Gin 
375 

Ser Ser Thr Gly Thr Val 
390 395 

Val Val Phe Asp Arg Ala 
410 

Cys His Val His Asp Glu 
425 

Val Thr Leu Asp Met Glu 
440 

Glu Ser Thr Leu Met Thr 
455 



Ala Gly Phe Pro Leu 
220 

Ser Met He He Gly 
240 

Trp Tyr Thr Pro He 
255 

Arg Val Glu He Asn 
270 

Asn Tyr Asp Lys Ser 
285 

Pro Lys Lys Val Phe 
300 

Ser Thr Glu Lys Phe 
320 

Cys Trp Gin Ala Gly 
335 

Leu Tyr Leu Met Gly 
350 

Leu Pro Gin Gin Tyr 
365 

Asp Asp Cys Tyr Lys 
380 

Met Gly Ala Val He 
400 

Arg Lys Arg He Gly 
415 

Phe Arg Thr Ala Ala 
430 

Asp Cys Gly Tyr Asn 
445 

He Ala Tyr Val Met 
460 
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Ala Ala lie Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys 
465 470 475 480 



Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala 
485 490 495 

Asp Asp lie Ser Leu Leu Lys 
500 

<210> 4 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 4 

Ser Glu Val Lys Met Asp Ala Glu Phe Arg 
15 10 

<210> 5 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 5 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
15 10 

<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 6 



8 



Ser Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 7 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 7 

Lys Gly Gly Val Val lie Ala Thr Val lie Val Lys 
15 10 

<210> 8 
<211> 4 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Asp Thr Ser Gly 
1 



<210> 9 
<211> 8 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Leu Val Asn Met Ala Glu Gly Asp 

1 5 

<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 10 

ggtaagcatc ccccatggcc ccaacgtc 



9 



<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 11 

gacgttgggg ccatggggga tgcttacc 

<210> 12 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 12 

acgttgtctt tgatcgggcc cgaaaacgaa ttgg 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 13 

ccaattcgtt ttcgggcccg atcaaagaca acg 

<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 

ccatcctaat acgactcact atagggc 
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<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 15 

actcactata gggctcgagc ggc 



<210> 16 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

cttttgagca agttcagcct ggttaa 



<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

gaggtggctt atgagtattt cttccagggt a 31 



<210> 18 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

tggcgacgac tcctggagcc eg 
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<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

tgacaccaga ccaactggta atgg 24 

<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 20 

catatggcgg gagtgctgcc tgcccac 27 

<210> 21 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 21 

ggatcctcac ttcagcaggg agatgtcatc agcaaagt 3 8 

<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxidized 
Insulin B-chain 

<220> 
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<223> Xaa at site 3 represents cysteic acid 



<400> 22 

His Leu Xaa Gly Ser His Leu Val 
1 5 

<210> 23 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxidized 
Insulin B-chain 

<220> 

<223> Xaa at site 1 represents cysteic acid 
<400> 23 

Xaa Gly Glu Arg Gly Phe Phe Tyr 
1 5 

<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 24 

Val Gly Ser Gly Val 
1 5 

<210> 25 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 25 
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Val Gly Ser Gly Val Leu Leu 
1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 26 

Gly Val Leu Leu Ser Arg Lys 
1 5 

<210> 27 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inhibitors 
<400> 27 

Val Asn Leu Ala Ala Glu Phe 
1 5 

<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inhibitors 
<400> 28 

Glu Val Asn Leu Ala Ala Glu Phe 
1 5 

<210> 29 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 29 
Asn Leu Ala Ala 
1 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 30 

Val Gly Ser Gly Val Leu Leu Ser Arg Lys 
15 10 



<210> 31 
<211> 326 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Amino acids 2-5, 6-9, 13-20, 25-32, 65-67, 69-74, 
79-87, 89-91, 99-106, 119-122, 150-154, 164-167, 
180-183, 191-194, 196-199, 201-204, 210-214, 
221-223, 258-262, 265-269, and 275-278 are Beta 
Strands 

<220> 

<223> Amino acids 281-284, 286-288, 298-301, 310-315, 
and 319-324 are Beta strands 

<220> 

<223> Amino acids 48-51, 111-114, 136-142, 225-234, 
249-254, 271-274, and 303-306 are Helices 

<220> 

<223> Amino acids 12-13, 30, 32, 34-35, 73-77, 111, 117, 
120, 189, 213, 215, 217-220, 287, 289, 291, 298, 
and 300 are residues in contact with pepstatin. 
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<220> 

<223> Pepsin 



<400> 31 

Val Asp Glu Gin Pro Leu Glu Asn Tyr Leu Asp Met Glu Tyr Phe Gly 
1 5 io i 5 

Thr lie Gly He Gly Thr Pro Ala Gin Asp Phe Thr Val Val Phe Asp 
20 25 30 

Thr Gly Ser Ser Asn Leu Trp Val Pro Ser Val Tyr Cys Ser Ser Leu 
35 40 45 

Ala Cys Thr Asn His Asn Arg Phe Asn Pro Glu Asp Ser Ser Thr Tyr 
50 55 60 

Gin Ser Thr Ser Glu Thr Val Ser He Thr Tyr Gly Thr Gly Ser Met 
65 70 75 80 

Thr Gly He Leu Gly Tyr Asp Thr Val Gin Val Gly Gly He Ser Asp 
85 90 95 

Thr Asn Gin He Phe Gly Leu Ser Glu Thr Glu Pro Gly Ser Phe Leu 
100 105 no 

Tyr Tyr Ala Pro Phe Asp Gly He Leu Gly Leu Ala Tyr Pro Ser He 
115 120 125 

Ser Ser Ser Gly Ala Thr Pro Val Phe Asp Asn He Trp Asn Gin Gly 
130 135 140 

Leu Val Ser Gin Asp Leu Phe Ser Val Tyr Leu Ser Ala Asp Asp Gin 
145 150 155 160 

Ser Gly Ser Val Val He Phe Gly Gly He Asp Ser Ser Tyr Tyr Thr 
165 170 175 

Gly Ser Leu Asn Trp Val Pro Val Thr Val Glu Gly Tyr Trp Gin He 
180 185 190 

Thr Val Asp Ser He Thr Met Asn Gly Glu Ala He Ala Cys Ala Glu 
195 200 205 

Gly Cys Gin Ala He Val Asp Thr Gly Thr Ser Leu Leu Thr Gly Pro 
210 215 220 

Thr Ser Pro He Ala Asn He Gin Ser Asp He Gly Ala Ser Glu Asn 
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225 



230 



235 



240 



Ser Asp Gly Asp Met Val Val Ser 
245 

Asp lie Val Phe Thr He Asn Gly 
260 

Ala Tyr He Leu Gin Ser Glu Gly 
275 280 

Met Asn Leu Pro Thr Glu Ser Gly 
290 295 

Phe He Arg Gin Tyr Phe Thr Val 
305 310 

Gly Leu Ala Pro Val Ala 
325 



Cys Ser Ala He Ser Ser Leu Pro 
250 255 

Val Gin Tyr Pro Val Pro Pro Ser 
265 270 

Ser Cys He Ser Gly Phe Gin Gly 
285 

Glu Leu Trp He Leu Gly Asp Val 
300 

Phe Asp Arg Ala Asn Asn Gin Val 
315 320 
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activity of presenilin therefore enhances the progression of Alzheimer's disease. 
This is supported by the observation that in the absence of presenilin gene, the 
production of alpha-beta42 peptide is lowered (De Strooper et aL, Nature 391, 
387 (1998)). Since unprocessed presenilin is degraded quickly, the processed, 

5 heterodimeric presenilin must be responsible for the accumulation of alpha- 
beta42 leading to Alzheimer's disease. The processing of presenilin by 
memapsin 2 would enhance the production of alpha-beta42 and therefore, 
further the progress of Alzheimer's disease. Therefore a memapsin 2 inhibitor 
that crosses the blood brain barrier can be used to decrease the likelihood of 

10 developing or slow the progression of Alzheimer's disease which is mediated by 
deposition of alpha-beta42. Since memapsin 2 cleaves APP at the beta cleavage 
site, prevention of APP cleavage at the beta cleavage site will prevent the build 
up of alpha-beta42. 
Vaccines 

1 5 The catalytically active memapsin 2 or fragments thereof including the 

active site defined by the presence of two catalytic aspartic residues and 
substrate binding cleft can be used to induce an immune response to the 
memapsin 2. The memapsin 2 is administered in an amount effective to elicit 
blocking antibodies, i.e., antibodies which prevent cleavage of the naturally 
20 occurring substrate of memapsin 2 in the brain. An unmodified vaccine may be 
useful in the prevention and treatment of Alzheimer's disease. The response to 
the vaccine may be influenced by its composition, such as inclusion of an 
adjuvant, viral proteins from production of the recombinant enzyme, and/or 
mode of administration (amount, site of administration, frequency of 
25 administration, etc). Since it is clear that the enzyme must be properly folded in 
order to be active, antibody should be elicited that is active against the 
endogenous memapsin 2. Antibodies that are effective against the endogenous 
enzyme are less likely to be produced against the enzyme that is not properly 
refolded. 



12M728 



25 



OMR1 1X2 



Pharmaceutical^ Acceptable Carriers 

The inhibitors will typically be administered orally or by injection. Oral 
administration is preferred. Alternatively, other formulations can be used for 
delivery by pulmonary, mucosal or transdermal routes. The inhibitor will 
5 usually be administered in combination with a pharmaceutically acceptable 
carrier. Pharmaceutical carriers are known to those skilled in the art. The 
appropriate carrier will typically be selected based on the mode of 
administration. Pharmaceutical compositions may also include one or more active 
ingredients such as antimicrobial agents, antiinflammatory agents, and analgesics. 
10 Preparations for parenteral administration or administration by injection 

include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions 
1 5 or suspensions, including saline and buffered media. Preferred parenteral 
vehicles include sodium chloride solution, Ringer's dextrose, dextrose and 
sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include 
fluid and nutrient replenishes, and electrolyte replenishers (such as those based 
on Ringer's dextrose). 
20 Formulations for topical (including application to a mucosal surface, 

including the mouth, pulmonary, nasal, vaginal or rectal) administration may 
include ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and 
powders. Formulations for these applications are known. For example, a number 
of pulmonary formulations have been developed, typically using spray drying to 
25 formulate a powder having particles with an aerodynanmic diameter of between 
one and three microns, consisting of drug or drug in combination with polymer 
and/or surfactant 

Compositions for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, capsules, sachets, or 
30 tablets. Thickeners, flavorings, diluents, emulsifiers, dispersing aids or binders 
may be desirable. 
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Peptides as described herein can also be administered as a 
pharmaceutical^ acceptable acid- or base- addition salt, formed by reaction with 
inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, 
nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids 
such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, 
pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fomaric 
acid, or by reaction with an inorganic base such as sodium hydroxide, 
ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, 
di-, trialkyl and aryl amines and substituted ethanolamines. 

Dosages 

Dosing is dependent on severity and responsiveness of the condition to be 
treated, but will normally be one or more doses per day, with course of treatment 
lasting from several days to several months or until the attending physician 
determines no further benefit will be obtained. Persons of ordinary skill can 
determine optimum dosages, dosing methodologies and repetition rates. 

The dosage ranges are those large enough to produce the desired effect in 
which the symptoms of the memapsin 2 mediated disorder are alleviated 
(typically characterized by a decrease in size and/or number of amyloid plaque, 
or by a failure to increase in size or quantity), or in which cleavage of the alpha- 
beta42 peptide is decreased. The dosage can be adjusted by the individual 
physician in the event of any counterindications. 

The present invention will be further understood by reference to the 
following non-limiting examples. 
Example 1. Cloning of memapsin 2. 

L Cloning and nucleotide sequence of pro-memapsin 2. 

New sequences homologous to human aspartic proteases were found in 
the following entries in the EST IMAGE database: AA1 36368 pregnant uterus 
ATCC 947471, AA207232 neurepithelium ATCC 214526, and R55398 human 
breast ATCC 392689. The corresponding bacterial strains: #947471, #214526, 
and # 392689 containing the EST sequences were obtained from the ATCC 
(Rockville, MD). The sequencing of these clones obtained from ATCC 
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confirmed that they contained sequences not identical to known human aspartic 
proteases. The completed sequences of these clones assembled into about 80% 
of prepro-M2 cDNA. Full length cDNAs of these clones were obtained using 
the following methods. 

The Human Pancreas Marathon-Ready cDNA (Clontech), which is 
double-strand cDNA obtained by reverse-transcription, primer addition, and 
second strand synthesize of mRNA from human tissues, was used as template 
for PCR amplification. An adapter primer (API) and a nested adapter primer 
(AP2) were used for 5'- and 3'-RACE PCR. For PCR the 5'-region of the 
memapsin 2 cDNA, primers API and NHASPR1 were used. Primers for the 3'- 
end of the cDNA are NHASPF2 and API. The middle of the cDNA was 
amplified by primers NHASPF1 and NHASPR2. The sequence for the primers 
is as follows: NHASPF1: GGTAAGCATCCCCCATGGCCCCAACGTC (SEQ 
ID NO: 10), 

NHASPR1 : GACGTTGGGGCCATGGGGGATGCTTACC (SEQ ID NO: 1 1 ), 
NHASPF2: ACGTTGTCTTTGATCGGGCCCGAAAACGAATTGG (SEQ ID 
NO: 12), 

NHASPR2: CCAATTCGTTTTCGGGCCCGATCAAAGACAACG (SEQ ID 
NO: 13), 

API : CCATCCTAATACGACTCACTATAGGGC (SEQ ID NO: 14), and 
AP2: ACTCACTATAGGGCTCGAGCGGC (SEQ ID NO: 15) 

Memapsin 2 was also cloned from a human pancreas library (Quick- 
Screen Human cDNA Library Panel) contained in lambda-gtlO and lambda-gtl 1 
vectors. The primers from the vectors, GT10FWD, GT10REV, GT1 1FWD, and 
GT1 1REV, were used as outside primers. The sequence of the primers used was: 
GT10FWD: CTTTTGAGCAAGTTCAGCCTGGTTAA (SEQ ID NO: 16), 
GT10REV: GAGGTGGCTTATGAGTATTTCTTCCAGGGTA (SEQ ID 
NO: 17), 

GT11FWD: TGGCGACGACTCCTGGAGCCCG (SEQ ID NO: 18), 
GT1 1 REV: TGACACCAGACCAACTGGTAATGG (SEQ ID NO: 19). 
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In addition, memapsin 2 cDNA was amplified directly from the human 
pancreatic lambda-gtlO and lambda-gtl 1 libraries. The sequence of the primers 
was: PASPN1: catatgGCGGGAGTGCTGCCTGCCCAC (SEQ ID NO:20) 
and 

5 NHASPC1 : ggatccTCACTTCAGCAGGGAGATGTCATCAGCAAAGT (SEQ 
IDNO:21). 

The amplified memapsin 2 fragments were cloned into an intermediate 
PCR vector (Invitrogen) and sequenced. 

The assembled cDNA from the fragments, the nucleotide and the 
10 deduced protein sequence are shown in SEQ ID NO 1 and SEQ ID NO 2. 

Pro-memapsin 2 is homologous to other human aspartiaproteases. 
Based on the alignments, Pro-memapsin 2 contains a pro region, an aspartic 
protease region, and a trans-membrane region near the C-terminus. The active 
enzyme is memapsin 2 and its pro-enzyme is pro-memapsin 2. 
15 Example 2. Distribution of memapsin 2 in human tissues. 

Multiple tissue cDNA panels from Clontech were used as templates for 
PCR amplification of a 0.82 kb fragment of memapsin 2 cDNA. The primers 
used for memapsin 2 were NHASPF1 and NHASPR2. Tissues that contain 
memapsin 2 or fragments of memapsin 2 yielded amplified PCR products. The 
20 amount of amplified product indicated that memapsin 2 is present in the 

following organs from most abundant to least abundant: pancreas, brain, lung, 
kidney, liver, placenta, and heart. Memapsin 2 is also present in spleen, 
prostate, testis, ovary, small intestine, and colon cells. 

Example 3. Expression of pro-memapsin 2 cDNA in E. coli, refolding and 
25 purification of pro-memapsin 2. 

The pro-memapsin 2 was PCR amplified and cloned into the BamHl site 
of a pETl la vector. The resulting vector expresses pro-memapsin 2 having a 
sequence from Ala-8p to Ala 326. Figure 1 shows the construction of two 
expression vectors, pETl 1 -memapsin 2-T1 (hereafter Tl) and pETl 1 -memapsin 
30 2-T2 (hereafter T2). In both vectors, the N-terminal 1 5 residues of the 

expressed recombinant proteins are derived from the expression vector. Pro- 
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memapsin 2 residues start at residue Ala-16. The two recombinant pro- 
memapsin 2s have different C-terminal lengths. Clone Tl ends at Thr- 454 and 
clone T2 ends at AIa-419. The Tl construct contains a C-terminal extension 
from the T2 construct but does not express any of the predicted transmembrane 
5 domain. 

Expression of recombinant proteins and recovery of inclusion bodies 
The Tl and T2 expression vectors were separately transfected into E. 
coli strain BL21(DE3). The procedures for the culture of transfected bacteria, 
induction for synthesis of recombinant proteins and the recovery and washing of 
10 inclusion bodies containing recombinant proteins are essentially as previously 
described (Lin et al., 1 994). 

Three different refolding methods have produced satisfactory results, 

(i) The rapid dilution method. 

Pro-memapsin 2 in 8 M urea/100 mM beta-mercaptoethanol with 
15 OD 2 80nm = 5 was rapidly diluted into 20 volumes of 20 mM-Tris, pH 9.0. The 
solution was slowly adjusted into pH 8 with 1 M HCL The refolding solution 
was then kept at 4° C for 24 to 48 hours before proceeding with purification. 

(ii) The reverse dialysis method 

An equal volume of 20 mM Tris, 0.5 mM oxidized/1.25 mM reduced 
20 glutathione, pH 9.0 is added to rapidly stirred pro-memapsin 2 in 8 M urea/10 
mM beta-mercaptoethanol with OD 28 o nm = 5. The process is repeated three 
more times with 1 hour intervals. The resulting solution is then diaiyzed against 
sufficient volume of 20 mM Tris base so that the final urea concentration is 0.4 
M. The pH of the solution is then slowly adjusted to 8.0 with 1 M HCL 
25 iil The preferred method for refolding. 

Inclusion bodies are dissolved in 8 M urea, 0.1 M Tris, 1 mM Glycine, 1 
mM EDTA, 100 mM beta-mercaptoethanol, pH 10.0. The OD 28 o of the 
inclusion bodies are adjusted to 5.0 with the 8 M urea solution without beta- 
mercaptoethanol. The final solution contains the following reducing reagents: 
30 10 mM beta-mercaptoethanol, 10 mM DTT (Dithiothreitol), 1 mM reduced 

glutathion, and 0.1 M oxidized glulathion. The final pH of the solution is 10.0. 
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The above solution is rapidly diluted into 20 volumes of 20 mM Tris 
base, the pH is adjusted to 9.0, and the resulting solution is kept at 4 °C for 16 
hr. The solution is equilibrated to room temperature in 6 hr, and the pH is 
adjusted to 8.5. The solution is returned to 4 °C again for 18 hr. 
5 The solution is again equilibrated to room temperature in 6 hr, and the 

pH is adjusted to 8.0. The solution is returned to 4 °C again for 4 to 7 days. 

The refolding procedures are critical to obtain an enzymically active 
preparation which can be used for studies of subsite specificity of M2, to 
analyze inhibition potency of M2 inhibitors, to screen for inhibitors using either 
10 random structural libraries or existing collections of compound libraries, to 
produce crystals for crystallography studies of M2 structures, and to produce 
monoclonal or polyclonal antibodies of M2. 

Purification of recombinant pro-memapsin 2-T2 
The refolded material is concentrated by ultrafiltration, and separated on 
1 5 a SEPHACRYL™ S-300 column equilibrated with 20 mM Tris.HCl, 0.4 M 
urea, pH 8.0. The refolded peak (second peak) from the S-300 column can be 
further purified with a FPLC RESOURCE-Q™ column, which is equilibrated 
with 20 mM Tris-HCl, 0.4 M urea, pH 8,0. The enzyme is eluted from the 
column with a linear gradient of NaCL The refolded peak from S-300 can also 
20 be activated before further purification. For activation, the fractions are mixed 
with equal volume 0.2 M Sodium Acetate, 70% glycerol, pH 4.0. The mixture is 
incubated at 22 °C for 18 hr, and then dialyzed twice against 20 volumes of 20 
mM Bis-Tris, 0.4 M urea, pH 6.0. The dialyzed materials are then further 
purified on a FPLC RESOURCE-Q™ column equilibrated with 20 Bis-Tris, 0.4 
25 M urea, pH 6.0. The enzyme is eluted with a linear gradient of NaCL 

SDS-PAGE analysis of the S-300 fractions under reduced and non- 
reduced conditions indicated that Pro-memapsin 2 first elutes as a very high 
molecular weight band (greater than about 42 kD) under non-reduced 
conditions- This indicates that the protein is not folded properly in these 
30 fractions, due to disulfide cross linking of proteins. Subsequent fractions 

contain a protein of predicted pro-memapsin 2-T2 size (about 42 kDa). The pro- 
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enzyme obtained in these fractions is also proteolytically active for auto- 
catalyzed activation. These fractions were pooled and subjected to 
chromatography on the FPLC RESOURCE™ column eluted with a linear 
gradient of NaCL Some fractions were analyzed using SDS-PAGE under non- 
5 reducing conditions. The analysis showed that fractions 6 and 7 contained most 
of the active proteins, which was consistent with the first FPLC peak containing 
the active protein. The main peak was coupled to a shoulder peak, and was 
present with repeated purification with the same RESOURCE™ Q column. The 
main shoulder peaks were identified as active pro-memapsin 2 that exist in 
1 0 different conformations under these conditions. 

Example 4, Proteolytic activity and cleavage-site preferences of 
recombinant memapsin 2, 

The amino acid sequence around the proteolytic cleavage sites was 
determined in order to establish the specificity of memapsin 2. Recombinant 
15 pro-memapsin 2-T1 was incubated in 0.1 M sodium acetate, pH 4.0, for 16 
hours at room temperature in order to create autocatalyzed cleavages. The 
products were analyzed using SDS-polyacrylamide gel electrophoresis. Several 
bands which corresponded to molecular weights smaller than that of pro- 
memapsin 2 were observed. The electrophoretic bands were trans-blotted onto a 
20 PVDF membrane. Four bands were chosen and subjected to N-terminal 

sequence determination in a Protein Sequencer. The N-terminal sequence of 
these bands established the positions of proteolytic cleavage sites on pro- 
memapsin 2. 

In addition, the oxidized B-chain of bovine insulin and two different 
25 synthetic peptides were used as substrates for memapsin 2 to determine the 
extent of other hydrolysis sites. These reactions were carried out by auto- 
activated pro-memapsin 2 in 0.1 M sodium acetate, pH 4.0, which was then 
incubated with the peptides. The hydrolytic products were subjected to HPLC 
on a reversed phase C-18 column and the eluent peaks were subjected to 
30 electrospray mass spectrometry for the determination of the molecular weight of 
the fragments. Two hydrolytic sites were identified on oxidized insulin B-chain 
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(Tablel). Three hydrolytic sites were identified from peptide NCH-gamma. A 
single cleavage site was observed in synthetic peptide PS 1 -gamma, whose 
sequence (LVNMAEGD) (SEQ ID NO:9) is derived from the beta-processing 
site of human presenilin 1 (Table 1). 
Table 1: Substrate Specificity of Memapsin 2 



Site# 


Substrate 


P4 


P3 


P2 


PI 


PI* 


P2' 


P3' 


P4' 




1 


Pro- 

memapsin 2 


R 


G 


S 


M 


A 


G 


V 


L 


aa 12-18 of SEQ ID No.3 


2 


G 


T 


Q 


H 


G 


I 


R 


L 


aa 23-30 of SEQ ID No.3 


3 


S 


S 


N 


F 


A 


V 


G 


A 


aa 98-105 of SEQ ID No.3 


4 


G 


L 


A 


Y 


A 


E 


I 


A 


aa 183-190 of SEQ ID No.3 


5 


Oxidized 
insulin B- 
chain 1 


H 


L 


C A 


G 


S 


H 


L 


V 


C A is cysteic acid; 
SEQ ID No. 22 
SEQ ID No. 23 


6 


C A 


G 


E 


R 


G 


F 


F 


Y 


7 


Synthetic 
peptide* 








V 


G 


S 


G 


V 


Three sites cleaved in a peptide: 
VGSGVLLSRK (SEQ ID 
NO:30) 

SEQ ID No. 24 

SEQ ID No. 25 

SEQ ID No. 26 


8 




V 


G 


s 


G 


V 


L 


L 


9 


G 


V 


L 


L 


S 


R 


K 




10 


Peptide** 


L 


V 


N 


M 


A 


E 


G 


D 


SEQ ID No. 9 



Example 5. Activation of pro-memapsin 2 and enzyme kinetics. 

Incubation in 0.1 M sodium acetate, pH 4.0, for 16 h at 22°C auto- 
catalytically converted pro-Mi^ to M2 p <j. For initial hydrolysis tests, two 
10 synthetic peptides were separately incubated with /?ro-M2 pd in 0. 1 MNa acetate, 
pH 4.0 for different periods ranging from 2 to 18 h. The incubated samples 
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were subjected to LC/MS for the identification of the hydrolytic products. For 
kinetic studies, the identified HPLC (Beckman System Gold) product peaks 
were integrated for quantitation. The Km and kcat values for presenilin 1 and 
Swedish APP peptides (Table 1) were measured by steady-state kinetics. The 
5 individual Km and kcat values for APP peptide could not be measured accurately 
by standard methods, so its kcat/Km value was measured by competitive 
hydrolysis of mixed substrates against presenilin 1 peptide (Fersht, A. "Enzyme 
Structure and Mechanism", 2 nd Ed., W.H. Freeman and Company, New York. 
(1985)). 

10 The results are shown in Figures 2A and 2B. The conversion of pro- 

M2 p d at pH 4.0 to smaller fragments was shown by SDS-polyacrylamide 
electrophoresis. The difference in migration between pro-M2^ and converted 
enzyme is evident in a mixture of the two. Figure 2A is a graph of the initial 
rate of hydrolysis of synthetic peptide swAPP (see Table 1) by M2 p(J at different 

1 5 pH, Figure 2B is a graph of the relative kcat/Km values for steady-state kinetic of 
hydrolysis of peptide substrates by M2 pc j. 
Example 6, Expression in Mammalian cells. 
Methods 

PM2 cDNA was cloned into the EcoRV site of vector pSecTag A 
20 (Invitrogen). Human APP cDNA was PCR amplified from human placenta 8- 
gtl 1 library (Clontech) and cloned into the Nhel and Xbal sites of pSecTag A. 
The procedure for transfection into HeLa cells and vaccinia virus infection for 
T7-based expression are essentially the same as described by Lin, X., FASEB J. 
7:1070-1080(1993). 
25 Transfected cells were metabolically labeled with 200 microCi 35 S 

methionine and cysteine (TransLabel; ICN) in 0.5 ml of serum-free/methionine- 
free media for 30 min, rinsed with 1 ml media, and replaced with 2 ml 
DMEM/10% FCS. In order to block vesicle acidification, Bafilomycin Al was 
included in the media (Perez, R.G., el aL, J Biol Chem 271 :9 100-9 107 (1996)). 
30 At different time points (chase), media was removed and the" cells were 

harvested and lysed in 50 mM Tris, 0.3 M NaCl, 5 mM EDTA, 1% Triton X- 
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100, pH 7.4, containing 10 mM iodoacetamide, 10 :M TPCK, 10 :M TLCK, and 
2 microg/ml leupeptin. The supernatant (14,000 x g) of cell lysates and media 
were immunoadsorbed onto antibody bound to protein G sepharose (Sigma). 
Anti- APP N-terminal domain antibody (Chemicon) was used to recover the 
betaN-fragment of APP and anti-alpha-beta j.n antibody (Chemicon, 
recognizing the N-terminal 17 residues of alpha-beta) was used to recover the 12 
kDa B C-fragment The former antibody recognized only denatured protein, so 
media was first incubated in 2 mM dithiothrietol 0.1% SDS at 55°C for 30 min 
before immunoabsorption. Samples were cooled and diluted with an equal 
volume of cell lysis buffer before addition of anti- APP N-terminal domain 
(Chemicon). Beads were washed, eluted with loading buffer, subjected to SDS- 
P AGE (NOVEX™) and visualized by autoradiogram or phosphorimaging 
(Molecular Dynamics) on gels enhanced with Amplify (Amersham). 
Immunodetection of the betaN-fragment was accomplished by transblotting onto 
a PVDF membrane and detecting with anti-alpha-betaun and chemiluminescent 
substrate (Amersham). 
Results . 

HeLa cells transfected with APP or M2 in 4-well chamber slides were 
fixed with acetone for 10 min and permeabilized in 0.2% Triton X-100 in PBS 
for 6 min. For localizing M2, polyclonal goat anti-/7ra-M2 pd antibodies were 
purified on DEAE-sepharose 6B and affinity purified against recombinant pro- 
M2 pd immobilized on Affigel (BioRad). Purified anti-/?ra-M2 p(1 antibodies were 
conjugated to Alexa56€ (Molecular Probes) according to the manufacturer's 
protocol. Fixed cells were incubated overnight with a 1 :100 dilution of antibody 
in PBS containing 0.1% BSA and washed 4 times with PBS. For APP, two 
antibodies were used. Antibody A 6 i-n (described above) and antibody AIJ n_42> 
which recognizes the first 26 residues following the beta-secretase cleavage site 
(Chemicon). After 4 PBS washes, the cells were incubated overnight with an 
anti-mouse FITC conjugate at a dilution of 1 :200. Cells were mounted in 
Prolong anti-fade reagent (Molecular Probes) and visualized on a Leica TCS 
confocal laser scanning microscope. 
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Example 7: Design and Synthesis of OM99-1 and OM99-2. 

Based on the results of specificity studies of memapsin 2, it was 
predicted that good residues for positions PI and PI' would be Leu and Ala. It 
was subsequently determined from the specificity data that PI' preferred small 
5 residues, such as Ala and Sen However, the crystal structure (determined below 
in Example 9) indicates that this site can accommodate a lot of larger residues. 
It was demonstrated that P V of memapsin 2 is the position with the most 
stringent specificity requirement where residues of small side chains, such as 
Ala, Ser, and Asp, are preferred. Ala was selected for PI' mainly because its 

10 hydophobicity over Ser and Asp is favored for the penetration of the blood-brain 
barrier, a requirement for the design of a memapsin 2 inhibitor drug for treating 
Alzheimer's disease. Therefore, inhibitors were designed to place a transition- 
state analogue isostere between Leu and Ala (shown as Leu* Ala, where * 
represents the transition-state isostere, -CH(OH)-CH 2 -) and the subsite P4, P3, 

15 P2, P2\ P3 f and P4 1 are filled with the beta-secretase site sequence of the 
Swedish mutant from the beta-amyloid protein. The structures of inhibitors 
OM99-1 and OM99-2 are shown below and in Figures 3 A and 3B, respectively: 

OM99- 1 : Val-Asn-Leu* Ala-Ala-Glu-Phe (SEQ. ID NO. 27) 

20 OM99-2: Glu-Vai-Asn-Leu*Ala-Ala-Glu-Phe (SEQ. ID NO. 28) 

The Leu* Ala dipeptide isostere was synthesized as follows: 
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The Leu- Ala dipeptide isostere for the M 2 -inhibitor was prepared from 
L-leucine, As shown in Scheme 1, L-leucine was protected as its BOC- 
derivative 2 by treatment with BOC 2 0 in the presence of 10% NaOH in diethyl 
ether for 12 h. Boc-leucine 2 was then converted to Weinreb amide 3 by 
treatment with isobutyl chcloroformate and N-methylpiperidine followed by 
treatment of the resulting mixed anhydride with N,0-dimethylhydroxylamine 




/BuOCOCI 
N-methyl piperidine, 
MeONHMe.HCI, 97 

H 




(Scheme I) 

(Nahm and Weinreb, Tetrahedron Letters 1981, 32, 3815). Reduction of 3 with 
lithium aluminum hydride in diethyl ether provided the aldehyde 4. Reaction of 
the aldehyde 4 with lithium propiolate derived from the treatment of ethyl 
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propiolate and lithium diisopropylamide afforded the acetylenic alcohol 5 as an 
inseparable mixture of diastereomers (5.8:1) in 42% isolated yield (Fray, Kaye 
and Kleinman, J. Org. Chem. 1986, 51, 4828-33). Catalytic hydrogenation of 5 
over Pd/BaSCU followed by acid-catalyzed lactonization of the resulting 
gamma-hydroxy ester with a catalytic amount of acetic acid in toluene at reflux, 
furnished the gamma-lactone 6 and 7 in 73% yield. The isomers were separated 
by silica gel chromatography by using 40% ethyl acetate in hexane as the eiuent. 




LHMDS, Mel, 



THF 
70-76 % 




H 




Me 



1. Aq. LiOH 

2. TBDMSCI, 
imidazole, 

DMF 



— Si 



OH 



1. TFA : DCM 
(1:4), O^C 

Aq. NaHCO 3 , 
dioxane 



H 




OH 



O 



(Scheme 2) 

Introduction of the methyl group at C-2 was accomplished by stereoselective 
alkylation of 7 with methyl iodide (Scheme 2). Thus, generation of the dianion 

1 0 of lactone 7 with lithium hexamethyldisilazide (2.2 equivalents) in 

tetrahydrofuran at -78°C (30 min) and alkylation with methyl iodide (LI 
equivalenfs) for 30 min at -78°C, followed by quenching with propionic acid (5 
equivalents), provided the desired alkylated lactone 8 (76% yield) along with a 
small amount (less than 5%) of the corresponding epimer (Ghosh and Fidanze, 

15 1998 J, Org. Chem. 1998, 63, 6146-54). The epimeric cis-lactone was removed 
by column chromatography over silica gel using a mixture (3: 1) of ethyl acetate 
and hexane as the solvent system. The stereochemical assignment of alkylated 
lactone 8 was made based on extensive 1 H-NMR NOE experiments. Aqueous 
lithium hydroxide promoted hydrolysis of the lactone 8 followed by protection 

20 of the gamma-hydroxyl group with /er/-butyldimethylsilyl chloride in the 
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presence of imidazole and dimethylatninopyridine in dimethylformamide 
afforded the acid 9 in 90% yield after standard work-up and chromatography. 
Selective removal of the BOC-group was effected by treatment with 
trifluoroacetic acid in dichloromethane at 0°C for 1 h. The resulting amine salt 
5 was then reacted with commercial (Aldrich, Milwaukee) Fmoc-succinimide 
derivative in dioxane in the presence of aqueous NaHCC>3 to provide the Fmoc- 
protected L* A isostere 10 in 65% yield after chromatography. Protected 
isostere 10 was utilized in the preparation of a random sequence inhibitor 
library. 

1 0 Experimental procedure 

N-(tert-Butoxycarbonyl)-L-Leucine (2), 

To the suspension of 10 g (76.2 mmol ) of L-leucine in 140 mL of 
diethyl ether was added 80 mL of 10 % NaOH. After all solid dissolves, 20 mL 
(87.1 mmol) of BOC2O was added to the reaction mixture. The resulting 

15 reaction mixture was stirred at 23°C for 12 h. After this period, the layers were 
separated and the aqueous layer was acidified to pH 1 by careful addition of 1 N 
aqueous HC1 at 0 °C. The resulting mixture was extracted with ethyl acetate (3 x 
100 mL). The organic layers were combined and washed with brine and dried 
over anhydrous Na2S04. The solvent was removed under reduced pressure to 

20 provide title product which was used directly for next reaction without further 
purification (yield, 97 %). l H NMR (400 MHz, CDC1 3 ) 5 4.89 (broad d, 1H, J 
= 8.3 Hz), 4.31 (m, 1H), 1.74- 1.49 (m, 3H), 1.44 (s, 9H) 3 0.95 (d, 6H, J = 6.5 
Hz).- 

N~(tert-Butoxycarbonyl)-L-leucine-N '-methoxy-N '-methyla-mide (3). 

25 To a stirred solution of N,0-dimethylhydroxy amine hydrochloride (5.52 

g, 56.6 mmol) in dry dichloromethane (25 mL) under N2 atmosphere at 0°C, - 
methylpiperidine (6.9 mL, 56.6 mmol) was added dropwise. The resulting 
mixture was stirred at 0°C for 30 min. In a separate flask, N-(tert- 
butyloxycarbonyI)-L-leucine (1) (1 1.9 g, 51.4 mmol) was dissolved in a mixture 

30 of THF (45 mL) and dichloromethane (1 80 mL) under N 2 atmosphere. The 
resulting solution was cooled to — 20°C. To this solution was added 1- 



1251728 



OMRI" 182 
204S7/2S2 



methylpiperidine (6.9 mL, 56.6 mmol) followed by isobutyl chloroformate (7.3 
mL, 56.6 mmol). The resulting mixture was stirred for 5 minutes at -20°C and 
the above solution of N,0-dimethylhydroxyamine was added to it. The reaction 
mixure was kept -20 °C for 30 minutes and then warmed to 23°C. The reaction 
5 was quenched with water and the layers were seperated. The aqueous layer was 
extracted with dichloromethane (3 x 100 mL). The combined organic layers 
were washed with 10% citric acid, saturated sodium bicarbonate, and brine. The 
organic layer was dried over anhydrous Na 2 S0 4 and concentrated under the 
reduced pressure. The residue was purified by flash silica gel chromatography 

1 0 (25% ethyl acetate/hexane) to yield the title compound 3 ( 1 3 .8 g, 97%) as a pale 
yellow oil. l H NMR (400 MHz, CDC1 3 ) 8 5.06 (broad d, 1H, J. = 9.1 Hz), 4.70 
(m, 1H), 3.82 (s, 3H), 3.13 (s, 3H), 1.70 (m, 1H), 1.46-1.36 (m, 2H) 1.41 (s, 
9H), 0.93 (dd, 6H, J = 6.5, 14.2 Hz). 
N-( tert-Butoxycarbonyl) -L-leucinal (4 ). 

15 To a stirred suspension of lithium aluminum hydride (770 mg, 20.3 

mmol) in dry diethyl ether (60 mL) at -40 °C under N 2 atmosphere, was added 
N-tert-butyloxycarbonyl-L-leucine-N'-methoxy-N'-methylamide (5.05 g, 18.4 
mmol) in diethyl ether (20 mL). The resulting reaction mixture was stirred for 
30 min. After this period, the reaction was quenched with 10% NaHS0 4 

20 solution (30 mL). The resulting reaction mixture was then warmed to 23°C and 
stirred at that temperature for 30 min. The resulting solution was filtered and the 
filter cake was washed by two portions of diethyl ether. The combined organic 
layers were washed with saturated sodium bicarbonate, brine and dried over 
anhydrous MgS0 4 . Evaporation of the solvent under reduced pressure afforded 

25 the title aldehyde 4 (3 .4 1 g) as a pale yellow oil. The resulting aldehyde was 
used immediately without further purification. ! H NMR (400 MHz, CDC1 3 ) 5 
9.5 (s, 1H), 4.9 (s, 1H), 4.2 (broad m, 1H), 1.8-1.6 (m, 2H), 1.44 (s, 9H), 1.49- 
1.39 (m, 1H), 0.96 (dd, 6H, J = 2.7, 6.5 Hz). 

Ethyl (4S,5S)-and (4R,5S)-5-[(terl-Buloxycarbonyl)amino]-4-hydroxy-7- 
30 melhyloct-2-ynoate (5). 
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To a stirred solution of diisopropylamine (1.1 mL, 7.9 mmol) in dry THF 
(60 mL) at 0°C under N2 atmosphere, was added n-BuLi (1 .6 M in hexane, 4.95 
mL, 7.9 mmol) dropwise. The resulting solution was stirred at 0°C for 5 min 
and then warmed to 23 °C and stirred for 15 min. The mixture was cooled to - 

5 78°C and ethyl propiolate (801 jaL) in THF (2 mL) was added dropwise over a 
period of 5 min. The mixture was stirred for 30 min, after which N-Boc-L- 
leucinal 4 (1.55 g, 7.2 mmol) in 8 mL of dry THF was added. The resulting 
mixture was stirred at -78°C for 1 h. After this period, the reaction was 
quenched with acetic acid (5 mL) in THF (20 mL). The reaction mixure was 

10 warmed up to 23 °C and brine solution was added. The layers were separated 
and the organic layer was washed with saturated sodium bicarbonate and dried 
over Na2S04. Evaporation of the solvent under reduced pressure provided a 
residue which was purified by flash silica gel chromatography (15 % ethyl 
acetate / hexane) to afford a mixture (3:1) of acetylenic alcohols 5 (0.96 g, 42 

15 %). *H NMR (300 MHz, CDC1 3 ) 6 4.64 (d, 1H, J = 9.0 Hz), 4.44 (broad s, 1H), 
4.18 (m, 2H ), 3.76 (m, 1H), 1.63 (m, 1H), 1.43-1.31 (m, 2H), 1.39 (s, 9H), 1.29- 
1.18 (m,3H), 0.89 (m, 6H). 

(5SJ 'S)-5-[l -[(tert-Butoxycarbonyl)amino]-3'-methylbu{yl] 
-dihydrofuran-2(3H)-one (7). 

20 To a stirred solution of the above mixture of acetylenic alcohols (1 .73 g, 

5.5 mmol ) in ethyl acetate (20 mL) was added 5% Pd/BaS0 4 (1 g). The 
resulting mixture was hydrogenated at 50 psi for 1.5 h. After this period, the 
-catalyst was filtered off through a plug of Celite and the filtrate was 
concentrated under reduced pressure. The residue was dissolved in toluene (20 

25 mL) and acetic acid (100 pL). The reaction mixure was refluxed for 6 h. After 
this period, the reaction was cooled to 23 °C and the solvent was evaporated to 
give a residue which was purified by flash silica gel chromatography (40% 
diethyl ether / hexane) to yield the (5S, 1 S>gamma-lactone 7 (0.94 g, 62.8 and 
the (5R, lS>gamma-lactone 6 (0.16 g, 10.7 %). Lactone 7: 'H NMR (400 

30 MHz, CDCI3) 8 4.50-4.44 (m, 2H), 3.84-3.82 (m, 1H), 2.50 (1, 2H, J - 7.8 Hz), 
2.22-2.10 (m,2H), 1.64-1.31 (m, 3H), 1.41 (s,9H),0.91 (dd,6H, J =2.2,6.7 
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Hz); U C NMR (75 MHz, CDCl 3 ) 5 177.2, 156.0, 82.5, 79.8, 51.0, 42.2, 28,6, 
28.2, 24.7, 24.2, 23.0,21.9. 

(3R,5S,1 'S)-5-[l ''[(tert-Butoxycarbonytyamino)]^ '-methylbut-yl]-3-methyl 
dihydrofuran-2(3H)-one (8). 

5 To a stirred solution of the lactone 7 (451.8 mg, L67 mmol) in dry THF 

(8 mL) at -78°C under N2 atmosphere, was added lithium hexamethyldisilazane 
(3.67 mL, 1.0 M in THF) over a period of 3 min. The resulting mixture was 
stirred at -78°C for 30 min to generate the lithium enolate. After this period, 
Mel (228 pL) was added dropwise and the resulting mixture was stirred at - 

10 78°C for 20 min. The reaction was quenched with saturated aqueous NH4CI 
solution and was allowed to warm to 23°C. The reaction mixture was 
concentrated under reduced pressure and the residue was extracted with ethyl 
acetate (3 x 100 mL). The combined organic layers were washed with brine and 
dried over anhydrous NajSCU. Evaporation of the solvent afforded a residue 

1 5 which was purified by silica gel chromatography (1 5 % ethyl acetate / hexane) 
to furnish the alkylated lactone 8 (0.36 g, 76 %) as an amorphous solid. l H 
NMR (300 MHz, CDCI3) 5 4.43 (broad t, 1H, J = 6.3 Hz), 4.33 (d, 1H, J = 9.6 
Hz), 3.78 (m, 1H), 2.62 (m, 1H), 2.35 (m, 1H), 1.86 (m , 1H), 1.63-1.24 (m, 
3H), 1.37 (s, 9H), 1.21 (d, 3H, J = 7.5 Hz), 0.87 (dd, 6H, J = 2.6, 6.7 Hz); i3 C 

20 NMR (75 MHz, CDC1 3 ) 5 180.4, 156.0, 80.3, 79.8, 51.6, 41.9, 34.3, 32,5, 28.3, 
24.7, 23.0,21.8, 16.6. 

(2R t 4S, 5S)-5-[(tert-Butoxycarbonyl)amino ]-4-[(tert-butyldimeth 
' -yl$ilyl)oxy ]-2, 7-dimethyloctanoic acid (9). 

To a stirred solution of lactone 8 (0.33 g, 1.17 mmol ) in THF (2 mL) 
25 was added 1 N aqueous LiOH solution (5.8 mL). The resulting mixture was 

stirred at 23°C for 10 h. After this period, the reaction mixture was concentrated 
under reduced pressure and the remaining aqueous residue was cooled to 0°C 
and acidified with 25% citric acid solution to pH 4. The resulting acidic 
solution was extracted with ethyl acetate (3 x 50 mL). The combined organic 
30 layers were washed with brine, dried over Na2S04 and concentrated to yield the 
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corresponding hydroxy acid (330 mg) as a white foam. This hydroxy acid was 
used directly for the next reaction without further purification. 

To the above hydroxy acid (330 mg, 1.1 mmoi) in anhydrous DMF was 
added imidazole (1.59 g, 23.34 mmol) and tert-butyldimethylchlorosilane (1.76 
g, 1 1.67 mmol). The resulting mixture was stirred at 23°C for 24 h. After this 
period, MeOH (4 mL) was added and the mixture was stirred for 1 h. The 
mixure was diluted with 25% citric acid (20 mL) and was extracted with ethyl 
acetate (3 x 20 mL). The combined extracts were washed with water, brine and 
dried over anhydrous Na2S04. Evaporation of the solvent gave a viscous oil 
which was purified by flash chromatography over silica gel (35% ethyl acetate / 
hexane) to afford the silyl protected acid 9 (0.44 g, 90 %). IR (neat) 3300-3000 
(broad ), 2955, 2932, 2859, 171 1 cm" 1 ; ! H NMR (400 MHz, DMSO-d 6 , 343 K) 
delta 6.20 (broad s, 1 H), 3.68 (m, 1H), 3.51 (broad s, 1H), 2.49-2.42 (m, 1H), 
1.83 (t, 1H, J - 10.1 Hz), 1.56 (m, 1H), 1.37 (s, 9H), 1.28-1.12 (m, 3H), 1.08 (d, 
3H, J = 7.1 Hz), 0.87 (d, 3H, J = 6.1 Hz) 0.86 (s, 9 H), 0.82 (d, 3H, J = 6.5 Hz), 
0.084 (s, 3H), 0.052 (s, 3H). 

(2R, 4S,5S)~5-[ (fluorenylmethyloxycarbonyl) amino ]-4-[(tert-butyldi~rnethyl 
silyl)oxy]-2, 7-dimethyloctanoic acid (10). 

To a stirred solution of the acid 9 (0.17 g , 0.41 mmol) in 
dichloromethane (2 mL) at 0°C was added trifluoroacetic acid (500 jjL). The 
resulting mixture was stirred at 0°C for 1 h and an additional portion (500 pL) 
of trifluoroacetic acid was added to the reaction mixture. The mixture was 
stirred for an additional 30 min and the progress of the reaction was monitored 
by TLC. After this period, the solvents were carefully removed under reduced 
pressure at a bath temperature not exceeding 5°C. The residue was dissolved in 
dioxane (3 mL) and NaHC03 (300 mg) in 5 mL of H2O. To this solution was 
added Fmoc-succinimide (166.5 mg, 0.49 mmol) in 5 mL of dioxane. The 
resulting mixture was stirred at 23°C for 8 h. The mixure was then diluted with 
H2O (5 mL) and acidified with 25% aqueous citric acid to pH 4. The acidic 
solution was extracted with ethyl acetate (3 x 50 mL). The combined extracts 
were washed with brine, dried over Na2SCXi and concentrated under reduced 
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pressure to give a viscous oil residue. Purification of the residue by flash 
chromatography over silica gel afforded the Fmoc-protected acid 10 (137 mg, 
61%) as a white foam. ! H NMR (400 MHz, DMSO-d 6 , 343 K) 5 7.84 (d, 2H, J 
- 7.4 Hz), 7.66 (d, 2H, J = 8 Hz), 7.39 (t, 2H, J = 7.4 Hz), 7.29 (m, 2H), 6.8 (s, 
1H), 4.29-4.19 (m, 3H), 3.74-3,59 (m, 2H), 2.49 (m, 1H), 1.88 (m, 1H), 1.58 (m, 
1H), 131-1.17 (m, 3H), 1.10 (d, 3H, J = 7.1 Hz), 0.88 (s, 9H), 0.82 (d, 6H, J = 
6.2 Hz), 0.089 (s, 3 H), 0.057 (s, 3H). 

The synthesis of OM99-1 and OM99-2 were accomplished using solid 
state peptide synthesis procedure in which Leu* Ala was incorporated in the 
fourth step. The synthesized inhibitors were purified by reverse phase HPLC and 
their structure confirmed by mass spectrometry. 
Example 8. Inhibition of Memapsin 2 by OM99-1 and OM99-2. 

Enzyme activity was measured as described above, but with the addition 
of either OM99-1 or OM99-2. OM99-1 inhibited recombinant memapsin 2 as 
shown in Figure 5 A. The Ki calculated is 3 x 10' 8 M. The substrate used was a 
synthetic fluorogenic peptide substrate. The inhibition of OM99-2 on 
recombinant memapsin 2 was measured using the same fluorogenic substrate. 
The Ki value was determined to be 9.58 x 10" 9 M, as shown in Figure 5B. 

These results demonstrate that the predicted subsite specificity is 
accurate and that inhibitors can be designed based on the predicted specificity. 

The residues in P 1 and PT are very important since the M2 inhibitor 
must penetrate the blood-brain barrier (BBB), The choice of Ala in PT 
facilitates the penetration of BBB. Analogues of Ala side chains will also work. 
For example, in addition to the methyl side chain of Ala, substituted methyl 
groups and groups about the same size like methyl or ethyl groups can be 
substituted for the Ala side chain. Leu at PI can also be substituted by groups of 
similar sizes or with substitutions on Leu side chain. For penetrating the BBB, it 
is desirable to make the inhibitors smaller. One can therefore use OM99-1 as a 
starting point and discard the outside subsites P4, P3, P3 f and P4 1 . The retained 
structure Asn-Leu* Ala-Ala (SEQ ID NO:29) is then further evolved with 
substitutions for a tight-binding M2 inhibitor which can also penetrate the BBB. 
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Example 9. Crystallization and X-ray diffraction study of the protease 
domain of human memapsin 2 complexed to a specifically designed inhibitor, 
OM99-2. 

The crystallization condition and preliminary x-ray diffraction data on 
recombinant human memapsin 2 complexed to OM99-2 were determined. 
Production of Recombinant Memapsin 2 

About 50 mg of recombinant memapsin 2 was purified as described in 
Example 3. For optimal crystal growth, memapsin 2 must be highly purified, 
Memapsin 2 was over-expressed from vector pETl la-M2pd. This memapsin 2 
is the zymogen domain which includes the pro and catalytic domains to the end 
of the C-terminal extension but does not include the transmembrane and the 
intracellular domains. The vector was transfected into E. coli BL21 (DE3) and 
plated onto ZB agar containing 50 mg/liter ampicillin. A single colony was 
picked to inoculate 100 ml of liquid ZB containing 5 mg ampicillin and cultured 
at 30 °C, for 18 hours, with shaking at 220 RPM. Aliquots of approximately 15 
ml of the overnight culture were used to inoculate each 1 liter of LB containing 
50 mg of ampicillin. Cultures were grown at 37 °C, with shaking at 180 RPM, 
until an optical density at 600 nm near 0.8 was attained. At that time, expression 
was induced by addition of 1 19 mg of IPTG to each liter of culture. Incubation 
was continued for 3 additional hours post-induction. 

Bacteria were harvested, suspended in 50 mM Tris, 150 mM NaCl, pH 
7.5 (TN buffer), and lysed by incubation with 6 mg lysozyme for 30 minutes, 
followed by freezing for 18 hours at -20 °C. Lysate was thawed and made to 1 
mM MgCh then 1000 Kunitz units of DNAse were added with stirring, and 
incubated for 30 min. Volume was expanded to 500 ml with TN containing 0, 1 
% Triton X-100 (TNT buffer) and lysate stirred for 30 minutes. Insoluble 
inclusion bodies containing greater than 90% memapsin 2 protein were pelleted 
by centrifugation, and washed by resuspension in TNT with stirring for 1 -2 
hours. Following three additional TNT washes, the memapsin 2 inclusion 
bodies were dissolved in 40 ml of 8 M urea, 1 mM EDTA, 1 mM glycine, 100 
mM Tris base, 100 mM beta-mercaptoethanlol (8 M urea buffer). Optical 
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density at 280 run was measured, and volume expanded with 8 M urea buffer to 
achieve final O.D. near 0.5, with addition of sufficient quantity of beta- 
mercaptoethanol to attain 10 mM total, and 10 mM DTT, 1 mM reduced 
glutathione, 0.1 mM oxidized glutathione. The pH of the solution was adjusted 
to 10.0 or greater, and divided into four aliquots of 200 ml each. Each 200 ml 
was rapidly-diluted into 4 liters of 20 mM Tris base, with rapid stirring. The pH 
was adjusted immediately to 9.0, with 1 M HC1, and stored at 4 °C overnight. 
The following morning the diluted memapsin 2 solution was maintained at room 
temperature for 4-6 hours followed by adjusting pH to 8.5 and replacing the 
flasks to the 4 °C room. The same procedure was followed the next day with 
adj ustment of pH to 8 .0. 

This memapsin 2 solution was allowed to stand at 4 °C for 2-3 weeks. 
The total volume of approximately 16 liters was concentrated to 40 mis using 
ultra-filtration (Millipore) and stir-cells (Amicon), and centrifuged at 140,000 xg 
at 30 minutes in a rotor pre-equilbrated to 4 °C. The recovered supernatant was 
applied to a 2.5 x 100 cm column of S-300 equilibrated in 0.4 M urea, 20 mM 
Tris-HCl, pH 8.0, and eluted with the same buffer at 30 ml/hour. The active 
fraction of memapsin 2 was pooled and further purified in a FPLC using a 1 ml 
Resource-Q (Pharmacia) column. Sample was filtered, and applied to the 
Resource-Q column equilibrated in 0.4 M urea, 50 mM Tris-HCl, pH 8.0. 
Sample was eluted with a gradient of 0 - 1 M NaCl in the same buffer, over 30 
ml at 2 ml/min. The eluents containing memapsin 2 appeared near 0.4 M NaCl 
which was pooled for crystallization procedure at a concentration near 5 mg/ml. 

The amino-terminal sequence of the protein before crystallization 
showed two sequences starting respectively at residues 28p and 3 Op. Apparently, 
the pro peptide of recombinant pro-memapsin 2 had been cleaved during the 
preparation by a yet unidentified proteolytic activity. 

The activation of the folded pro-enzyme to mature enzyme, memapsin 2, 
was carried out as described above, i.e., incubation in 0. 1 M sodium acetate pH 
4.0 for 16 hours at 22 °C Activated enzyme was further purified using anion- 
exchange column chromatography on Resource-Q anion exchange column. The 
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purity of the enzyme was demonstrated by SDS-gel electrophoresis. At each 
step of the purification, the specific activity of the enzyme was assayed as 
described above to ensure the activity of the enzyme. 
Preliminary Crystallization with QM99-2 
5 Crystal trials were performed on purified memapsin 2 in complex with a 

substrate based transition-state inhibitor OM99-2 with a Ki = 10 nM. OM99-2 
is equivalent to eight amino-acid residues (including subsites S4, S3, S2, SI Sl\ 
S2\ S3' and S4 1 in a sequence EVNLAAEF) with the substitution of the peptide 
bond between the SI and SI' (L-A) by a transition-state isostere 
10 hydroxyethylene. Purified M2 was concentrated and mixed with 10 fold 
excessive molar amount of inhibitor. The mixture was incubated at room 
temperature for 2-3 hours to optimize the inhibitor binding. The crystallization 
trial was conducted at 20 °C using the hanging drop vapor diffusion procedure. 
A systemic search with various crystallization conditions was conducted to find 
1 5 the optimum crystallization conditions for memapsin 2/OM99-2 inhibitor 

complex. For the first step, a coarse screen aimed at covering a wide range of 
potential conditions were carried out using the Sparse Matrix Crystallization 
Screen Kits purchased from Hampton Research. Protein concentration and 
temperature were used as additional variables. Conditions giving promising 
20 (micro) crystals were subsequently used as starting points for optimization, 
using fine grids of pH, precipitants concentration etc. 

Crystals of memapsin-inhibitor complex were obtained at 30% PEG 
80Q0, 0.1 M NaCocadylate, pH 6,4. SDS gel electrophoresis of a dissolved 
crystal verified that the content of the crystal to be memapsin 2. Several single 
25 crystals (with the sizes about 0.3 mm x 0.2 mm x 0. 1 mm) were carefully 

removed from the cluster for data collection on a Raxis IV image plate. These 
results showed that the crystals diffract to 2.6 A. A4ypical^protein^diffraction 
paHemns^x)wn-in Figure-6- An X-ray image visualization and integration 
software-Denzo, was used to visualize and index the diffraction data. Denzo 
30 identified that the primitive orthorhombic lattice has the highest symmetry with 
a significantly low distortion index. The unit cell parameters were determined 
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as: a=89.1 A, b=96.6 A, c=134.1 A, a=p=y=90°. There are two memapsin 
2/OM99-2 complexes per crystallographic asymmetric unit, the V m of the crystal 
is 2.9 A 3 /Da. Diffraction extinctions suggested that the space group is P2i2i2j. 
With diffraction of the current crystal to 2.6 A, the crystal structure 
5 obtained from these data has the potential to reach atomic solution, i.e., the 

three-dimensional positions of atoms and chemical bonds in the inhibitor and in 
memapsin 2 can be deduced. Since memapsin 2 sequence is homologous with 
other mammalian aspartic proteases, e.g., pepsin or cathepsin D, it is predicted 
that the three dimensional structures of memapsin 2 will be similar (but not 
10 identical) to their structures. Therefore, in the determination of x-ray structure 
from the diffraction data obtained from the current crystal, it is likely the 
solution of the phase can be obtained from the molecular replacement method 
using the known crystal structure of aspartic proteases as the search model. 
Further Crystallization Studies 
15 Concentrated memapsin 2 was mixed with 1 0-fold molar excessive of 

the inhibitor. The mixture was incubated at room temperature for 2-3 hours to 
optimize inhibitor binding, and then clarified with a 0.2 micron filter using 
centrifiigation. Crystals of memapsin 2-inhibitor complex were grown at 20 °C 
by hanging drop vapor diffusion method using equal volumes of enzyme- 
20 inhibitor and well solution. Crystals of quality suitable for diffraction studies 
were obtained in two weeks in 0.1 M sodium cacodylate, pH 7.4, 0.2 M 
(NH4) 2 S0 4 , and 22.5% PEG8000. The typical size of the crystals was about 0.4 
x 0.4 x 0.2 mm 3 . 

Diffraction data were measured on a Raxis-IV image plate with a 
25 Rigaku X-ray generator, processed with the HKL program package [Z. 

Otwinowski, W. Minor, Methods Enzymol. 276, 307 (1997)] A single crystal 
of approximately 0.4 x 0.4 x 0.2 mm 3 in size was treated with a cryo-protection 
solution of 25% PEG8000, 20% glycerol, 0.1 M sodium-cacodylate pH 6.6, and 
0,2 M (NHL^SO^and then flash-cooled with liquid nitrogen to about -180 °C 
30 for data collection. Diffraction was observed to al least 1 .9 A. The crystal form 
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belongs to space group P2j with two memapsin 2/OM99-2 complexes per 
crystallographic asymmetric unit and 56% solvent content. 

Molecular replacement was performed with data in the range of 15.0- 
3.5 A using program AmoRe, CCP4 package [Navaza, J., Acta Crystallog. 
5 Sect. A. 50, 157 (1994)]. Pepsin, a human aspartic protease with 22% sequence 
identity, was used as the search model(PDB id lpsn). Rotation and translation 
search, followed by rigid body refinement, identified a top solution and 
positioned both molecules in the asymmetric unit. The initial solution had a 
correlation coefficient of 22% and an R-factor of 0.5 1 . The refinement was 
10 carried out using the program CNS [Brunger et aL, Acta Crystallogr, Sect. D, 
54, 905 (1998)]. 10% of reflections were randomly selected prior to refinement 
for Rfree monitoring [Bruger, AT., X-PLOR Version 3. 1 : A system for X-ray 
Crystallography and NMR, Yale University Press, New Haven, CT (1992)]. 
Molecular graphics program [Jones, T.A., et aL, Improved methods for 
1 5 building protein models in electron denisty maps and location of errors in these 
models. Acta Crystallogr. Sect. A 47, 1 10 (1991)] was used for map display 
and model building. From the initial pepsin model, corresponding amino acid 
residues were changed to that of memapsin 2 according to sequence alignment. 
The side chain conformations were decided by the initial electron density map 
20 and a rotomer library. This model was refined using molecular dynamics and 
energy minimization function of CNS [Bruger, A.T., et al., Acta Crystallogr. 
Sect. D, 54, 905 (1998)]. The first cycle of refinement dropped the Rworking to 
41% and the R free to 45%. At this stage, electron densities in the omit map 
clearly showed the inhibitor configuration in the active site cleft. Structural 
25 features unique to memapsin 2 in chain tracing, secondary structure, insertions, 
deletions and extensions (as compared to the search model) are identified and 
constructed in subsequent iterations of crystallographic refinement and map 
fitting. The inhibitor was built into the corresponding electron density. 

About 440 solvent molecules were then gradually added to the structure 
30 as identified in the |Fo|-JFc| map contoured at the 3 sigma level. Non- 

crystallographic symmetry restriction and averaging were used in early stages 
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of refinement and model building. Bulk solvent and anisotropic over-all B 
factor corrections were applied through the refinement. The final structure was 
validated by the program PROCHECK Laskowski, R.A. et ah, J. AppL 
Crystallog. 26, 283 (1993) which showed that 95% of the residues are located 
in the most favored region of the Ramachandran plot. All the main chain and 
side chain parameters are within or better than the standard criteria. The final 
Rworking and Rfree are 18% and 22% respectively. Refinement statistics are listed 
in Table 2. 
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Table 2, Data Collection and Refinement Statistics 



A. Data Statistics 

Space group P2i 

5 Unit Cell (a, b, and c in ) 53.7, 85.9, 109.2 

(, ,and in degrees) 90.0,101.4,90.0 

Resolution ( ) 25.0-1.9 

Number of observed reflections 1 44, 1 64 

Number of unique reflections 69,056 

10 R merg e a °' 061 (°- 25 ) 

Data completeness (%) (25,0-1.9 ) 90.0 (68.5) 

<I/(I)> 13.7(3.0) 
B, Refinement Statistics 

Rworking 0.186 

15 R free b 0.228 
RMS deviation from ideal values 

Bond length ( ) 0.014 

Bond angle (Deg) 1-7 

Number of water molecules 445 
20 Average B-factor ( ) 

Protein 28 - 5 

Solvent 32 - 2 



a R^ncrge = hki i I Iwd, i - <Ihki> I / hki <hki>, where I hWi i is the intensity of the 
25 ith measurement and <Ihki> is the weighted mean of all measurements of Ihki- 

b Rworkiag (free) = I F 0 H F c j / 1 F 0 |, where F 0 and F c are the observed and 
calculated structure factors. Numbers in parentheses are the corresponding 
numbers for the highest resolution shell (2.00-1.9 A). Reflections with 
F 0 / (F o )>=0.0 are included in the refinement and R factor calculation. 
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